# Continuous Action Space
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to safely control lunar landings.
Physics Model
P
sofiascat
14
1
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, specifically designed to solve the landing task in the LunarLander-v2 environment.
Physics Model
P
tooalvin
13
1
Mlagents Crawler
This is a PPO agent model trained using the Unity ML-Agents library, specifically designed for reinforcement learning tasks in the Crawler environment.
Molecular Model
TensorBoard

M
infinitejoy
21
0
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, designed to solve control tasks in the LunarLander-v2 environment.
Physics Model
P
sigalaz
20
0
Ppo LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to control the safe landing of a lunar lander.
Physics Model
P
andri
16
0
Sac Walker2d V3
This is a reinforcement learning model based on the SAC algorithm, specifically designed for the Walker2d-v3 environment to control bipedal robot walking.
Physics Model
S
sb3
43
0
Assignment2 Omar
This is a reinforcement learning model based on the PPO algorithm, specifically designed to solve the landing task in the LunarLander-v2 environment.
Physics Model
A
Classroom-workshop
135
3
Td3 Hopper V3
This is a TD3 agent model trained using the stable-baselines3 library, specifically designed for reinforcement learning tasks in the Hopper-v3 environment.
Physics Model
T
sb3
30
0
Ppo Hopper V3
This is a PPO reinforcement learning model trained based on the stable-baselines3 library, specifically designed for continuous control tasks in the Hopper-v3 environment.
Physics Model
P
sb3
19
0
Ppo HalfCheetah V3
This is a reinforcement learning model based on the PPO algorithm, specifically designed for the HalfCheetah-v3 environment and trained using the stable-baselines3 library.
Physics Model
P
sb3
51
1
Ppo MountainCar V0
This is a deep reinforcement learning model based on the PPO algorithm, specifically designed to solve control problems in the MountainCar-v0 environment.
Physics Model
P
sb3
21
1
Sac Pendulum V1
This is a reinforcement learning model based on the SAC algorithm, designed to solve control problems in the Pendulum-v1 environment.
Physics Model
S
sb3
39
0
PPO LunarLander V2
This is a reinforcement learning model based on the PPO algorithm, specifically trained for the LunarLander-v2 environment to safely control the lunar lander.
Physics Model
P
BioGeek
102
0
Dqn MountainCar V0
This is a DQN agent model trained using stable-baselines3, specifically designed to solve reinforcement learning tasks in the MountainCar-v0 environment.
Molecular Model
D
sb3
578
1
Ppo Pendulum V1
This is a reinforcement learning model based on the PPO algorithm, specifically designed to solve control problems in the Pendulum-v1 environment.
Physics Model
P
sb3
51
2
Ball
This is a reinforcement learning agent trained with the PPO algorithm, designed to control the balancing ball task in the Unity 3DBall game.
3D Vision
TensorBoard

B
ThomasSimonini
23
0
Decision Transformer Gym Hopper Medium
This is a decision transformer model trained on medium-performance trajectories in the Gym Hopper environment, suitable for continuous control tasks.
Physics Model
Transformers

D
edbeeching
6,518
6
Featured Recommended AI Models